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DNA SEQUENCES ENCODING CMH-1, A CYSTEINE PROTEASE INVOLVED IN APOPTOSIS 

TECHNICAL FIF -T r p OF THE INVENTION 
5 The invention relates to DNA sequences encoding 

active, acitvatable and inactive forms of CMH-1, a 
cysteine protease involved in programmed cell death. The 
invention further relates to the polypeptides encoded by 
those DNA sequences and antibodies to those polypeptides. 

10 The invention also relates to recombinant DNA molecules 
comprising these DNA sequences, as well as hosts 
transformed with such recombinant DNA molecules. The 
invention further relates to the use of the DNA sequences 
of this invention in gene therapy for both promoting and 

15 inhibiting apoptosis. The DNA sequences of this 

invention are also useful in diagnosing cells which have 
the potential to undergo apoptosis. Finally, the 
polypeptides encoded by the DNA sequences of this 
invention are useful in identifying inhibitors of 

20 apoptosis . 

BACKGROUND OF TfiF INVENTION 

Programmed cell death (apoptosis) plays an 
important role in embryonic development, homeostasis, and 
in diseases such as neurodegenerative disorders, 

25 autoimmune diseases and cancer [R- E. Ellis et al., Ann r 
Rpv. CM! Biol, , 7, pp. 663-98 (1991)]. Apoptosis is 
characterized by a set of cellular events including cell 
shrinkage, chromatin condensation, and DNA fragmentation. 
Activation of proteases is one of crucial events in the 

30 cellular execution of apoptosis (S. J. Martin et al., 
Cell , 82, pp. 349-52 (1995)]. 

■ - - - - Genet-ic- studies- --in-Eke - *»etaa fco d e ^a^oarh^ bdi ti 
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elegans have provided evidence that Ced-3 gene is 
indispensable for cell death during worm development [J. 
Yuan et al., Qsll, 75, pp. 641-52 (1993)]. The CED-3 
protein shares 28% sequence identity to mammalian 
interleukin-lb (IL-lb) converting enzyme ( ICE) that 
cleaves the inactive IL-lb precursor to the 
proinflammatory cytokine [N. A. Thornberry et al., 
Na£ui£, 356, pp. 768-74 (1992)]. Both ICE and CED-3 are 
members of a newly-discovered cysteine protease family 
that also includes mammalian enzymes Tx/ICH-2/ICEreill, 
ICEreUII, Nedd-2/ICH-l, CPP32/Yama/A P opain, and Mch2 [C. 
Faucheu et al., EMBQ s 7.., 9, pp. 1914-22 (1995); N. A. 
Munday et al., J, Pjpj , Chpm,, 270, pp. 15870-76 (1995); 
L. Wang et al . , ££11, 78, pp. 739-50 (1994); S. Kumar et 
al., Gene? n P v,, z 8, pp. 1613-26; T. Fernandes-Alnemri et 
al " J. Bjpl , C\]rm , 269, pp. 30761-64 (1994); T. 
Fernandes-Alnemri et al., Cancer R„ . 55, PP . 2737-42 
(1995)]. Each of these cysteine proteases are composed 
of two subunits of approximately 20 kDa and 10 kDa 
derived from processing of precursor polypeptides. They 
share conserved amino acid residues for catalysis and 
binding of Pi aspartate residue (K. P. Wilson et al . , 
Natura ' 370 ' PP- 270-73 (1994); and N. P. c. Walker et 
al., ££ll, 78, pp. 343-52 (1994)]. ~- 

Overexpression of cysteine proteases in the 
Ice/Ced-3 gene family leads to apoptosis of trans fected 
fibroblast cells fM . Miura et al., c^i, 75, pp. 653-60 
(1993); c. Faucheu et al., EMBQ J. . 9, pp. 1914-22 
(1995); T . Fernandes-Alnemri et al., J. Bio]. rh om 2 69, 
PP. 30761-64 (1994); T. Fernandes-Alnemri et al., Cancer 
Efi^, 55, pp. 2737-42 (1995); Y. Gu et al . , EMBQ J. . 9/ 
PP. 1923-31 (1995); M. Tewari et al . , C^ii, 81, pp. 3255- 
60 (1995)], and ICE inhibitors block cell death of 
neurons deprived of neurotrophic factors [V. Gaglardini 
et ial-^, Science., 2€-3— pp 826-"2T) "( 1994 )t~ and C. E. 
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Milligan et al., Neuron , IS, pp. 385-93 (1995)]. The 
involvement of ICE or related proteases has been 
implicated in Fas- and TNFcc-mediated apoptosis (K. Kuida 
et al,, Science , 267, pp. 2000-03 (1995); M. Enari et 
5 al., Nature , 375, pp. 78-79 (1995); M. Los et al., 

jiatU^ 375, pp. 81-83 (1995); M. Tewari et al., J. B i ol. 
Chentv 270, pp. -3255-3240 (1995); M. Miura et al., Cfiii, 
75, pp. 653-660 (1995) ] . 

Furthermore, ICE deficient mice are resistant 
10 to apoptosis induced by an anti-Fas antibody, suggesting 
a physiological role for ICE in Fas-mediated apoptosis of 
thymocytes (K. Kuida et al, 1995) . 

A number of proteins, including 
poly (ADP-ribose) polymerase (PARP), 70 kDa protein 
15 corqponent-^e^ the Ul small nuclear ribonucleoprotein, 

lamin Bl, fi-actin and topoisomerase I are proteolytically 
degraded during apoptosis. Most notably, the rapid and 
complete cleavage of 116-kDa PARP between Asp214 and 
Gly215 to 31-kDa and 85-kDa polypeptides occurs at the 
20 onset of apoptosis [S. H. Kauffraan et al., Canger Rq$., 
53, pp. 3976-85 (1993); Y. A. Lazebnik et al., N^tyre, 
371, pp. 346-47 (1995) ] . 

Some workers have suggested that CPP32 is the 
cysteine protease that cleaves PARP in a cytosolic 
25 extract of apoptotic cells [D. W. Nicholson et al. 

Nature , 376, pp. 37-43 (1995); Y. A. Lazebnik et al., 
(1995)]. Cytotoxic T lymphocyte (CTL) -specif ic granzyme 
B cleaves and activates CPP32, resulting in PARP cleavage 
in target cells, suggesting a protease activation cascade 
30 involving CPP32 in CTL-mediated apoptosis [A. J. Darmon 
et al., Nature , 377, pp. 446-48 (1995)]. In addition, 
CPP32 cleaves and activates sterol responsive element 
binding proteins (SREBP) that are involved in 
transcription regulation of genes for low density 
35 lipoprotein receptor and 3-hydroxy-3-methylglutaryl CoA 
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Synthase (X. Wang et al., J, PjoJ , , 270, pp. 18044- 

50 (1995)]. * 

Despite these developments to date, other 
members of the Ice/Ced-3 gene family of cysteine 
proteases need to be identified and characterized in 
order to fully understand the mechanisms of apoptosis. 



SUMMARY Of THE TNVf^jnp 

The invention provides a complete cDNA sequence 
encoding a 34-kDa polypeptide that is homologous to 
CPP32 . The full-length cDNA encodes the proenzyme 
Pactivatable") form of a cysteine protease - CMH-1 - 
.that is important in programmed cell death ("apoptosis"). 
The invention also provides cDNA sequences encoding 
active forms of CMH-1, as well as inactive, mutated forms 
of CMH-1. 

The invention also provides recombinant DNA 
molecules comprising any of the above-described cDNA 
sequences. The recombinant DNA molecules of this 
invention are used to transform both prokaryote and 
eukaryote hosts. Such transformed hosts are also part of 
this invention. The transformed hosts may be utilized to 
produce large quantities of the DNA sequences of this 
invention. Additionally, the transformed hosts of this 
invention may be utilized to produce any of the 
polypeptides encoded by those DNA sequences. 

The DNA sequences of this invention may be used 
in both therapeutic and diagnostic applications. The DNA 
sequences encoding a pro-enzyme form or an active form of 
CMH-1 are suitable for use in gene therapy to promote 
apoptosis, especially in cancer cells. In addition, 
those DNA sequences may be utilized to modulate 
cholesfee^l-.leve^s^«^-p^ t rentr^du e -to the ability of • 
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the expression product to activate elements involved in 
the transcription of low density lipoprotein receptors. 

The DNA sequences encoding inactive, mutant 
forms of CMH-1 are useful in gene therapy to prevent 
5 apoptosis by competing with the native, active form of 
the enzyme in the cell. This therapeutic use is 

especira-Hy-useful in preventing neurological diseases. 

The DNA sequences of this invention may be used 
as diagnostic tools to detect and quantify Cmh-1 mRNA 
10 levels in various cells. This method may be used to 
determine the sensitivity of cancer cells to 
chemotherapeutic treatment, A high Cmh-1 mRNA level 
suggests that the cell would be responsive to 
chemotherapy. This method may also be used to detect 
*l-5 diseases in cells that do not normally undergo apoptosis, 
especially cells involved in neurological disease. A 
high level of Cmh-1 mRNA would be indicative of a disease 
state. 

The Cmh-1 DNA sequences of this invention may 

20 also be used to inhibit the expression of CMH-1 

polypeptides in a cell through the use of anti-sense 
technology. Single stranded, antisense DNA can be 
introduced into cells where it can hybridize to and 
inhibit the translation of Cmh-1 mRNA. Such methods can 

25 be used to prevent undesirable apoptosis, such as is 
characteristic of neurological diseases. 

The active CMH-1 polypeptides of this invention 
are useful as reagents for screening potential apoptosis 
inhibitors. The ability of a compound to block CMH-1- 

30 mediated cleavage of either its natural substrate, 

poly (ADP-ribose) polymerase ("PARP"), or an appropriate 
synthetic peptidyl substrate, suggests that such a 
compound is a potential apoptosis inhibitor. 

The invention also provides monoclonal and 

35 polyclonal antibodies to CMH-1. These antibodies, 
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may be used to assay for CMH-1 levels in a patient or 
cell. In addition, CMH-1 antibodies are useful in 
inhibiting CMH-1 activity, and therefore, apoptosis 

BRIEF DES CRIPTION OF THE DRAWINGS 
Figure 1 depicts a comparison of the amino acid 
residues lining each of the substrate side-chain binding 
pockets of various members of the ICE cysteine protease 
family. Amino acid residues that are identical to those 
present in CMH-1 are shown in bold. 

Figure 2, panel A, depicts an immunoblot of 
extracts of COS cells transfected with plasmids 
expressing T7-tagged derivatives of an activated form of 
CMH-1 (Ala24-Gln303) (lane "CMH-lwt"), an inactivated 
mutant of CMH-1 (Ala24-Gln303) (lane "CMH-lmt") , a 
15 truncated PARP (Gu et al., 1995b) alone (lane "PARP"), 

truncated PARP and activated CMH-1 (Ala24-Gln303 ) together 
(lane "CMH- lwt+PARP" ) , or truncated PARP and the CMH-1 
mutant together (lane "CMH-lmt+PARP") . Immunoblot ting 
was carried out using an anti-T7 antibody. Arrows 
20 indicate the position of cleaved p20 CMH-1 . "PARP(T)" 
denotes truncated PARP. «PARP(*)" denotes the tagged 
N-terminal fragment of cleaved PARP. 

Figure 2, panel B depicts the induction of 
apoptosis in transfected COS cells by active or inactive 
25 CMH-1, as determined by internucleosomal DNA 
fragmentation. 

Figure 3 depicts the purification of 
recombinant (His) e -tagged CMH-1 following Ni-affinity 
chromatography. Proteins were detected either by 
Commassie blue staining (lane 1) or by immunoblotting 
with anti- (His) 6 -tag antibody (lane 2). 

Figure 4, panel A depicts the ability of 
recombinant active CMH-1 to cleave PARP in the presence 
stsftee-o-f-the tetrapeptlde aldehyde inhibitor 



30 
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DEVD-CHO (first four lanes). "PARP(T)" denotes the 
uncleaved truncated form of PARP. "PARP*" indicates the 
primary cleavage products (32 and 12 kDa) • 

Figure 4, panel A, also compares the ability of 
5 CMH-1 or ICE to cleave IL-lb precursor (last three 

lanes) . w pIL-lB" denotes the uncleaved precursor. "IL- 

IB" denotes the mature cleavage product. 

Figure 4, panel B depicts the effects of 
various protease inhibitors on the ability of CMH-1 to 
10 cleave "S-labelled PARP in vitro, as analyzed by SDS-PAGE 

and fluorography. 

Figure 5 depicts a Northern blot analysis of 
human Cmh-1 in heart (lane 1), brain (lane 2), placenta 
(lane 3), lung (lane 4), liver (lane 5), skeletal muscle 
-15- (lane 6), kidney (lane 7), and pancreas (lane 8) using 
full length CMH-1 cDNA as a probe. 

p fJATT.ED DESCRIPTION OF TME INVENTION 

The following definitions are used throughout 
the specification and the claims. 

20 "Cmh-1 " refers to nucleotide sequences, either 

DNA or RNA, single-stranded or double-stranded that 
encode a CMH-1 polypeptide. When used to define a 
double-stranded nucleotide sequence, the term also refers 
to the anti-sense nucleotide sequence. 

25 "CMH-1" and "CMH-1 polypeptide" refer to a 

polypeptide which shares at least 80%, and more 
preferably, at least 90% homology with amino acids 29-303 

Of SEQ ID NO: 2. 

An "active CMH-1 polypeptide" is a CMH-1 
30 polypeptide that possesses cysteine protease activity and 
is capable of cleaving PARP by the assay set forth in 
Example 4. Examples of active CMH-1 polypeptides of this 
invention are amino acids 24-303 of SEQ ID NO: and amino' 



WO 97/16552 



PCT/US96/17431 



0 



-8- 

acids 29-303 of SEQ ID NO: 2. 

An "activatable CMH-1 polypeptide" is an 
inactive form of a CMH-1 polypeptide which, upon 
cleavage, is converted into an active CMH-1 polypeptide 
(i.e., a proenzyme form). An example of an activatable 
CMH-1 polypeptide is SEQ ID NO: 2. 

An "inactive CMH-1 polypeptide" is a CMH-1 
Polypeptide that lacks cysteine protease activity, due to 
a mutation, deletion or substitution. Preferred examples 
of inactive CMH-1 polypeptides are those which have been 
mutated at one or more of the following catalytic 
residues: Cysl86, Trp232 or Arg233 present in SEQ ID 



NO: 2 



Additional terms are defined where necessary 
throughout the application. 

According to one embodiment, the invention 
provides an isolated DNA sequence encoding an active, 
activatable or inactive CMH-1 polypeptide selected from 
SEQ ID NO:l; DNA sequences that hybridize to SEQ ID NO:l; 
and DNA sequences which but for the degeneracy of the 
genetic code would hybridize to SEQ ID NO:l or sequences 
which hybridize thereto and code for a polypeptide having 
the same amino acid sequences coded for by SEQ ID NO:l or 
sequences which hybridize thereto. " " 

As used herein, .the term "hybridize to" refers 
to the ability of a denatured DNA sequence to hydrogen 
bond to another denatured DNA sequence through 
complementary base pairs under conditions which allow 
sequences having at least 80% homology to form such 
hybrids. Such conditions are well known in the art and 
are exemplified by salt and temperature conditions 
substantially equivalent to 5X SSC and 65°C for both 
hybridization and wash. 

Although only a single full-length cDNA 
sequence 'is set forth in the attached Sequence Listing, • 
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10 



15 



it will be readily apparent to those of skill in the art 
that any other DNA sequence which encode the same amino 
acid sequence (SEQ ID N0:2:): 
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20 are also be part of applicant's invention. Such DNA 

sequences merely utilize the redundancy of the genetic 
code to encode the same amino acids. While those 
specific sequences are not set forth herein due to space 
considerations, it should be understood that one of 

25 ordinary skill in the art could ascertain all of such DNA 
sequences merely by reference to the genetic code and 
without the exercise of inventive skill. 

In addition to genetically redundant DNA 
sequences, the invention also includes DNA sequences 

30 which encode other amino acid sequences which are at 

least 80%, and preferably at least 90% homologous to SEQ 
ID NO: 2. Because this aspect of the invention does not 
require the isolated DNA sequence to encode an active 
CMH-1, any nucleotides of SEQ ID N0:1 may be modified to 

35 produce a DNA sequence of this invention. The 
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identification and isolation of these additional 
sequences may be achieved by standard DNA library 
screening techniques (hybridization, PGR) using SEQ ID 
NO:l or portions thereof as a probe. Such homologous 
sequences may be found in any mammalian tissue cDNA 
library, as well as in insect cDNA libraries, yeast and 
other fungi cDNA libraries and prokaryotic cDNA 
libraries . 

According to an alternate embodiment, the 
invention provides an isolated DNA sequence which encodes 
an active CMH-1 polypeptide selected from nucleotides 
117-959 of SEQ ID N0:1, nucleotides 132-959 of SEQ ID 
N0:l; DNA sequences which hybridize to either of the 
former DNA sequences; and DNA sequences which but for the 
degeneracy of the genetic code would hybridise to and 
code for a polypeptide having the same amino acid 
sequence as any of the previous DNA sequences. Most 
preferably, the DNA sequences according to this 
embodiment are those that encode amino acids 24-303 or 
amino acids 29-303 of SEQ ID NO: 2. 

In this embodiment, the isolated DNA sequence 
must encode a catalytically active CMH-1. All DNA 
sequences which encode the same amino acids as 
nucleotides 117-959 of SEQ ID NO:l (amino a^cids 24-303 of 
SEQ ID NO: 2, or nucleotides 132-959 of SEQ ID NO:l (amino 
acids 29-303 of SEQ id NO: 2, are clearly within the scope 
of this invention. Such genetically degenerate sequences 
can be deduced by those of ordinary skill in the art 
without the exercise of inventive skill utilizing the 
well-known genetic code. 

Additional DNA sequences which encode active 
CMH-1 polypeptides may be identified by standard DNA 
library screening techniques using nucleotides 117-959 or 
132-959 of SEQ id NO:l or portions thereof as a probe. 
-SuGh-homologous-DNA sequences should not contain 
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nucleotides which alter the encoded catalytic residues 
Hisl44 or Cysl86 of SEQ ID NO: 2. Preferably, such 
homologous DNA sequences will also not contain 
nucleotides which alter the PI (Arg87, Glnl84, Arg233, 
5 Ser239) site amino acids of SEQ ID N0:2. More preferred 
are homologues which do not contain nucleotides that 
alter the P2 (Tyr 229, Trp232), P3 (Asn 180, Pro 235) and 
P4 (Ser234, Arg237, Trp240, Ser275) site amino acids of 
SEQ ID NO:2. Even more preferred are homologous DNA 
10 sequences which contain nucleotides encoding only 

conservative amino acid substitutions for some or all of 
the other amino acids of SEQ ID NO: 2 * The translation 
products of any of these DNA sequences may then be 
assayed for cysteine protease activity as described 
15 below. 

According to another embodiment, the invention 
provides an isolated DNA sequence encoding an inactive 
CMH-1 polypeptide selected from DNA sequences which 
hybridize to and are at least 80%, homologous to SEQ ID 

20 NO:l and which encode an active site mutation at either 
amino acid 144 or 186 of SEQ ID N0:2; and DNA sequences 
which but for the degeneracy of the genetic code would 
hybridize to and code for a polypeptide having the same 
amino acid sequence as the former DNA sequences. More 

25 preferred are DNA sequences which encode mutations at 
both amino acid 144 and 186 of SEQ ID N0:1. Even more 
preferred are DNA sequences which additionally encode 
mutations at one or more of the following amino acids of 
SEQ ID NO:2: 87, 184, 233, 239, 229, 232, 180, 235, 234, 

30 237, 240 and 275. Most preferred is a DNA sequence which 
encodes the following mutations in SEQ ID NO: 2: 
Cysl86-Ser, Trp232-Ala and Arg233-Glu. 

As set forth above, DNA sequences according to 
this aspect of the invention may be identified and 

35 isolated through standard cDNA library screening. 
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Specific mutations may be introduced into Cmh-1 DNA 
through the use of site-directed mutagenesis. The 
translation products of such DNA sequences may then be 
assayed for (lack of) cysteine protease activity as 
described below. 

According to another embodiment of this 
invention, any of the DNA sequences described above may 
be employed in a recombinant DNA molecule. The isolated 
DNA sequences of this invention may be inserted into any 
of the numerous commercially or otherwise" publicly 
available cloning vectors. Useful expression vectors, 
for example, may consist of segments of chromosomal, non- 
chromosomal and synthetic DNA sequences. Suitable 
vectors include derivatives of SV40 and known bacterial 
plasmids, e.g., E.colj plasmids col El, pCRl, pBR322, 
PMB9 and their derivatives, plasmids such as RP4; phage 
DNAs, e.g., the numerous derivatives of phage A, e.g., 
NM989, and other phage DNA, e.g., Ml 3 and Fi lamenl-ons 
single stranded phage DNA; yeast plasmids such as the 2p 
plasmid or derivatives thereof; vectors useful in 
eukaryotic cells, such as vectors useful in insect or 
mammalian cells; vectors derived from combinations of 
plasmids and phage DNAs, such as plasmids that have been 
modified to employ phage DNA or other expression control 
sequences; and the like. 

The choice of cloning vector will, of course, 
depend upon the cell type(s) to be transformed with the 
resulting recombinant DNA molecule, as well as. the 
ultimate use of that recombinant DNA molecules (i.e., DNA 
production, gene therapy, polypeptide expression) . 
Preferred eukaryotic vectors are SV40-derived vectors. 
Preferred prokaryotic vectors are E. coli expression 
vectors. Preferred viral vectors are modified eukaryotic 
viral vectors, preferably attenuated adenovirus vectors, 
J^t_may-.be-.used i-n gene -therapy, such as those described 
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in PCT publications WO 94/26915 and WO 94/28938, the 
disclosures of which are herein incorporated by 
reference. 

Techniques for inserting a DNA sequence of this 
5 invention into a vector to produce a recombinant DNA 

molecule involve standard molecular biological techniques 
and are well known in the art (see, for example, J, 
Sambrook, et al., Molecular Cloning - A Laboratory Manual, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 

10 New York (1989) ) . 

According to a preferred embodiment, the 
recombinant DNA molecule of this invention will 
additionally comprise an expression control sequence 
operatively linked to a DNA sequence of this invention* 

15 The term "expression control sequence" refe rs* - ..to a DNA 
sequence that controls and regulates the transcription 
and translation of another DNA sequence- It includes 
both 5' and 3' non-coding DNA sequencing and optionally 
includes an ATG start codon. 

20 The term "operatively linked" refers to the 

positioning of an expression control sequence with 
respect to a coding DNA sequence of interest such that 
the expression control sequence controls and regulates 
the transcription and translation of that DNA sequence. 

25 The term "operatively linked" includes having an 

appropriate start signal (e.g., ATG) in front of the DNA 
sequence to be expressed and maintaining the correct 
reading frame to permit expression of the DNA sequence 
under the control of the expression control sequence and 

30 production of the desired product encoded by the DNA 

sequence. If a gene that one desires to insert into a 
recombinant DNA molecule does not contain an appropriate 
start signal, such a start signal can be inserted in 
front of the gene. 

35 The choice of expression control sequence* 
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depends upon the nature of the recombinant DNA molecule, 
the host that will be transformed by that recombinant DNA 
molecule, and whether constitutive or inducible 
expression of a DNA sequence of this invention is 
desired. Such useful expression control sequences 
include, for example, the early and late promoters of 
SV40 or adenovirus, the lac system, the £ee system, the 
X&£ or 1R£ system, the major operator and promoter 
regions of phage A, the control regions of fd coat 
protein, the promoter for 3-phosphoglycerate kinase or 
other glycolytic enzymes, the promoters of acid 
phosphatase (e.g., Pho5) , the promoters of the yeast a- 
mating factors, and other sequences known to control the 
expression of genes of prokaryotic or eukaryotic cells or 
their viruses, and various combinations thereof. 

These recombinant DNA molecules arenased to 
produce CMH-1 polypeptides. Preferred expression control 
sequences are the SRa promoter, when a eukaryotic vector 
is employed, and any inducible promoter when a 
prokaryotic vector is employed. Most preferred are the 
expression control sequences contained in the 
commercially available pET-15b £. coli vector (Novagen) . 

The invention also provides host cells 
transformed by the recombinant DNA molecules^ of this 
invention. These hosts may include well known eukaryotic 
and prokaryotic hosts, such as strains of E.coli . 
Pseudomonfl? , Bacillus, Streptomvcps - fungi such as 
yeasts, and animal cells, such as CHO, Rl.l, B-W and L-M 
cells, African Green Monkey kidney cells (e.g., COS 1, 
COS 7, BSC1, BSC40, and BMT10) , insect cells (e.g., Sf 9) , 
and human cells and plant cells in tissue culture. 
Eukaryotic cells may harbor the recombinant DNA molecules 
of this invention as an extrachromosomal element or 
incorporate all or part of it into the host chromosome. 
It will be understood that not^ll.vectors, 
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expression control sequences and hosts will function 
equally well to replicate and/or express the DNA 
sequences of this invention. Neither will all hosts 
function equally well with the same expression system. 
5 However, one skilled in the art will be able to select 
the proper vectors, expression control sequences, and 
hosts without undue experimentation to accomplish the 
desired expression without departing from the scope of 
this invention. For example, in selecting a vector, the 

10 host must be considered because the vector must function 
in it. The vector's copy number, the ability to control 
that copy number, and the expression of any other 
proteins encoded by the vector, such as antibiotic and 
other selective markers, will also be considered. 

15 In selecting an expression control sequence, a 

variety of factors will normally be considered. ~ These 
include, for example, the relative strength of the 
system, its controllability, and its compatibility with 
the particular DNA sequence or gene to be expressed, 

20 particularly as regards potential secondary structures. 
Suitable unicellular hosts will be selected by 
consideration of, e.g., their compatibility with the 
chosen vector, their ability to fold proteins correctly, 
and their fermentation requirements, as well as the 

25 toxicity to the host of the product encoded by the DNA 
sequences to be expressed, and the ease of purification 
of the expression products. 

Considering these and other f actors a person 
skilled in the art will be able to construct a variety of 

30 vector/expression control sequence/host combinations that 
will replicate and/or express the DNA sequences of this 
invention on fermentation or in large scale animal 
culture. 

Methods for transforming cells with recombinant 
35 DNA molecules are well known in the art (see,- for 
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example, J. Sambrook, et al. f Molecular rim,< P1| ^ 
Laboratory Mam i nl , Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, New York (1989)). Any of those 
methods may be employed to produce the transformed hosts 
of this invention. Identification of transformed hosts 
may be achieved by assaying for the presence of Cmh-1 
DNA, cmh-1 RNA or a CMH-1 polypeptide. Additionally, 
transformants may be identified by growth in selective 
media. For this assay, the gene necessary for growth in 
selective media is cotransfected into the- cell either on 
the same or a different recombinant DNA molecule as the 
Cmh-1 DNA and is expressible in that cell. It will, of 
course, be obvious that the gene for selective growth 
should not present in the untransf ormed cell. 

The transformed hosts of this invention may be 
employed to produce either large quantities of- Cmh-1 DNA 
sequences of this invention and/or a CMH-1 polypeptide 
encoded thereby. In order to produce large quantities of 
Cmh-1 DNA, the host, preferably a prokaryotic host, is 
grown in a medium and under conditions that promote DNA 
replication and cell division. Any complete media 
routinely used to grow bacteria is suitable for this 
purpose. Following growth, the transformed cells are 
separated from the growth medium and plasmicf DNA is then 
isolated from the cells by standard and well-known 
techniques. Cmh-1 DNA may then be excised from the 
plasmid through the use of restriction endonucleases . 

When the transformed hosts of this are employed 
to produce a CMH-1 polypeptide, it is preferred that the 
CMH-1 polypeptide be active or activatable. In this 
embodiment, the preferred host is a mammalian cell. The 
transformed host should be grown in a medium that 
promotes expression of the CMH-1 polypeptide-encoding DNA 
sequence present in that host. If expression of that DNA 
sequence is under the control of a constitutive promoter-. 
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any standard growth medium is suitable. If the Croh-1 DNA 
is under the control of an inducible promoter, the growth 
medium should be supplemented with a compound that 
induces expression or growth conditions should be altered 
5 so as to induce expression (i.e., change in growth 

temperature) . Following expression, the transformed 
cells are separated from the growth medium, lysed and the 
CMH-1 polypeptide purified by standard methods. 

In order to determine the presence of an active 
10 CMH-1 polypeptide, a protease activity assay is used. 

Such an assay measures the ability of the polypeptide to 
cleave natural of synthetic substrates. Active CMH-1 
polypeptides cleave Asp-X bonds. The preferred natural 
substrate for these assays is poly (ADP-ribose) polymerase 
15 ("PARP"). Synthetic substrates useful in these assays 

are peptides containing an~Asp-X bond. A preferred assay 
for detecting active CMH-1 polypeptide is described in 
Example 4 . 

In addition to the CMH-1 polypeptides, the 

20 invention also provides antibodies to CMH-1 polypeptides. 
The generation of antibodies may be achieved by standard 
methods in the art for producing both polyclonal and 
monoclonal antibodies using a CMH-1 polypeptide or 
fragment thereof as antigen. Such antibodies or active 

25 fragments thereof (such as Fab fragments), may be used to 
assay in vivo or in vitro levels of CMH-1 polypeptide. 
Increased in vivo levels of CMH-1 polypeptides may be 
indicative of the onset of apoptosis. 

Antibodies to CMH-1 polypeptides also have 

30 potential therapeutic use in preventing apoptosis in a 

patient. Alternatively, the CMH-1 antibodies may be used 
to regulate cholesterol levels in a patient. As stated 
previously, the cysteine proteases involved in apoptosis 
also activate sterol responsive element binding proteins, 

35 which in turn regulate transcription of genes- encoding 
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the LDL receptor and 3-hydroxy-3-methylglutaryl CoA 
Synthase (X. Wang et al., (1995)). 

Such antibodies may be directed to their target 
cells through conjugation to cell-specific molecules. 
The conjugation of such antibodies and the identity of 
cell-specific targeting molecules is known in the art. 

Finally, affl-1 polypeptide antibodies may be 
employed to purify CMH-1 polypeptides from either native 
sources or transformed hosts expressing CMH-1 
polypeptides. For example, the antibodies can be 
conjugated by standard techniques to an insoluble matrix 
to form an immunoaf f inity resin. 

The Cmh-1 DNA sequences of this invention, of 
fragments thereof, may be used to identify homologous DNA 
sequences so as to potentially identify other members of 
this cysteine protease family. These DNA sequences, or 
fragments thereof, The DNA sequences of this invention 
are useful in a number of therapeutic and diagnostic 
applications . 

According to one embodiment, the DNA sequences 
encoding an active or activatable CMH-1 polypeptide may 
be used in either in vitro or in vivo gene therapy. In 
this embodiment, the DNA must be contained in a suitable 
vehicle for gene therapy. Such vehicles are- known in the 
art. For example, various viruses that are capable of 
transferring genetic material to a target cell, without 
replicating in that cell have been described (United 
States patents 5, 112,767, 5,240,846 and 5,112,7.67, the 
disclosures of which are herein incorporated by 
reference) . Such viruses include replication-defective 
adenoviruses, adeno-associated viruses (AAV), and 
replication defective retroviruses, such as PLJ, pZip, 
pWe and pEM. 

Gene delivery systems other than viruses can 
also be employed in the methods of this invention. For • 
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example, the gene to be transferred may be packaged in a 
liposome. When cells are incubated with DNA-encapsidated 
liposomes, they take up the DNA and express it. To form 
these liposomes , one mixes the DNA of an expression 
5 vector which expresses the gene to be transferred with 
lipid, such as N- [ 1- (2, 3, dioleyloxy) propyl ] -N, N, N-t ri- 
me thy 1 ammonium- Ghl or ide (D0TMA) in a suitable buffer, 
such as Hepes buffered saline. This causes the 
spontaneous formation of lipid-DNA complexes (liposomes) 

10 which can be employed in the methods of this invention 
[P. L. Feigner et al . , Proc. Natl. Acad. Sci. USA , 84, 
pp. 7413-17 (1987) ] . 

Another gene delivery system that may be 
utilized in this invention is DNA-protein complexes. The 

15 -^rfermati?©^ of these complexes is described in United 
States patent 5,166,320, the disclosure of which is 
herein incorporated by reference. Specifically, these 
complexes comprise the gene to be transferred (together 
with promoter, enhancer sequences and other DNA necessary 

20 for expression in the target cell) linked via a suitable 
polymer, such as polylysine, to a polypeptide ligand for 
a receptor or other cell surface protein. This complex 
is taken up by the target cells via endocytosis after the 
ligand binds to the cell surface receptor. The DNA is 

25 then cleaved from the rest of the complex via 

intracellular enzymes which cut the polymer linker. Such 
complexes are particularly useful in targeting specific 
cell types. For example, cancer cells may be 
specifically targeted through the use of proteins which 

30 specifically recognize tumor cell surface markers or 
receptors . 

The preferred use according to this embodiment 
is in the treatment of cancer in a patient. The term 
"patient", as used herein refers to any mammal, 
35 especially humans. Cancer cells transformed with a DNA ' 
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sequence encoding an active or activatable CMH-1 
polypeptide will express that DNA, which in turn, will 
cause cell death. 

DNA sequences encoding inactive CMH-1 
5 polypeptides are also useful in gene therapy to inhibit 
apoptosis. According to this embodiment, the inactive 
CMH-1 polypeptide encoded by this DNA would competitively 
compete for substrate with native, active CMH-1. This 
would decrease the CMH-1 activity in the cell, inhibiting 
10 apoptosis. Preferred cells to be treated by this method 
are nerve cells. 

The DNA sequences of this invention may also be 
used as anti-sense inhibitors of CMH-1 polypeptide mRNA 
and thus inhibitors of apoptosis. In this embodiment, 
15 single stranded, anti-sense Cmh-1 DNA is introduced into 
cells by standard techniques. Once inside the cell, the 
DNA binds to Cmh-1 mRNA and prevents it from being 
translated. 

According to an alternate embodiment, the DNA 
sequences of this invention may be used as a diagnostic 
tool to determine Cmh-1 mRNA levels in cells. This may 
be achieved by standard Northern blot or dot blotting 
techniques that are well known in the art. For example, 
a low level of Cmh-1 mRNA in cancer cells ii" indicative 
that those cells will respond to chemotherapy. in normal 
cells, high levels of Cmh-1 mRNA may be predictive of 
disease, especially in nerve and other neurological 
cells . 

The active or activatable CMH-1 polypeptides of 
this invention are useful assay reagents for identifying 
and characterizing potential apoptosis inhibitors. In 
this embodiment, potential inhibitors are added to an 
assay mixture containing an active or activatable CMH-1 
polypeptide and a substrate. If an activatable form is 
used, the assay must employ conditions which convert that 
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form to an active form. Preferably, an active form of 
CMH-1 polypeptide is used. Inhibition of a CMH-1 
polypeptide's ability to cleave its substrate by a 
compound suggest that that compound is a potential 
5 inhibitor of apoptosis. 

In order that the invention described herein 
may be more fully understood, the following examples are 
set forth. It should be understood that these examples 
are for illustrative purposes only and are not to be 
10 construed as limiting this invention in any manner. 

EXAMPLE 1 
Tinning nf CMH-1 cDNA 
We initially screened the GenBank EST database 
maintained by the National Center for Biotechnology 
15 Information -fNCB-I) Sequence for cDNA clones encoding ICE- 
like polypeptides using the Blast algorithm [S. F. 
Altshul et al., J. Mol. Biol, , pp. 403-10 (1990)]. We 
generated initial multiple sequence alignments of known 
ICE family protein peptide sequences using the MACAW 
20 program (G. D. Schuler et al., Proteins : Structure, 

Function, and Genetics , 9, pp. 180-90 (1991)] and later 
refined by the combination of the PILEUP algorithm 
(Genetics Computer Group, 1995) and pairwirse comparison. 

Through this screen, we identified a partial 
25 human cDNA clone deposited by the Washington 

University-Merck EST project (accession number T50828) 
which had a partial open reading frame with limited 
similarity to our consensus ICE sequence and a 3' 
untranslated region. We obtained the partial human cDNA 
30 clone 72778 corresponding to this EST (expressed sequence 
tag) sequence through the IMAGE Consortium (Lawrence 
Livermore National Laboratory) . 

In order to generate a full-length cDNA clone, 
we ligated a spleen cDNA pool with a DNA adaptor 
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(Clontech, Palo Alto, CA) and subsequently performed PCR 
using a primer to the 5' adaptor (SEQ ID NO: 3: 
5' -CCATCCTAATACGACTCACTATAGGGC and a primer to the EST 
T50828 sequence SEQ ID N0:4: (5 f -GCAAACTCTGTCAATTCACCC) . 
Through this technique, we were able to generate a 0,8 kb 
SacI-HgiAI PCR fragment containing the 5 f coding region. 
We then ligated this fragment to the 1.55 kb HgiAI-Kpnl 
fragment from the 72778 clone containing the 3 ? region 
subcloned the resulting DNA into pBluescript SK 
(Stratagene, La Jolla CA) at Sacl/Kpnl to- generate the 
full length cDNA. 

We sequenced the resulting cDNA, named Cmh-1 
(£PP32/Mch2 homoiog-1), on both strands by ABI Prism Dye 
Deoxy Terminator sequencing using the ABI 373 DNA 
Sequencer. The CMH-1 sequence is set forth in the 
-Sequence Listing as SEQ ID NO:l. 

Analysis of the full length cDNA sequence 
identified a single open reading frame of 303 amino acids 
encoding a protein of 34 kDa, which we have named CMH-1 
The encoded protein is set forth in the Sequence Listing 
as SEQ ID NO: 2. We identified the catalytic residues as 
Hisl44 and Cysl86 in SEQ ID NO:2. 

CMH-1 was found to have 52% identity to CPP32 
identified by T. Fernandes-Alnemri et al. (£994) . In 
addition, there was sequence similarity with all members 
of the ICE/Ced3 gene family. Overall, the polypeptide 
can be divided into three domains, roughly corresponding 
to the P15, P20, and P10 subdomains of ICE. In CMH-1 
these domains are 2, 21, and 11 kD, respectively. 

There is strong similarity between CMH-1 and 
the sequence that comprises the core of the ICE 
structure, as well as a number of residues surrounding 
the active site Cys265 and His237 in the ICE crystal 
structure [K. P. Wilson et al., Nature , (1994); N. P. c. 
Walker et al.. Cell , (1994)]. Superposition of the CMH-i 
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sequence onto this ICE structure allowed the 
identification of a number of other residues located 
around the protease active site (Fig. 1) . 

Interestingly, all of the residues which line 
5 the substrate binding pocket at the PI position are 

identical with those found in ICE, including Arg87 and 

Arg-233,- corresponding to ICE Argl79 and Arg341. This 

strongly suggests that CMH-1 has substrate specificity 
for Asp in this position, as seen in other members of 
10 this family. 

There is much less sequence conservation in the 
binding pockets corresponding to the P3-P4 residues 
between CMH-1 and its closest homolog CPP32, suggesting 
that CMH-1 has a different substrate (s) . 

15 

EXAMPLE 2 

Ex pression And Processing o f CMH-1 in COS cells 

We used transient expression of CMH-1 in COS 
cells to determine if CMH-1 has the predicted cysteine 

20 protease activity. Removal of the pro-domain from the 
CPP32 precursor occurs between Asp28 and Ser29 ID. W. 
Nicholson et al., Ma£llE£, (1995)]. For CMH-1, we 
estimated the position at which the prodomain removal 
might occur as Asp23-Ala24. 

25 We therefore produced a cDNA fragment encoding 

an N-terminally T7-tagged CMH-1 lacking the first 23 
amino acid residues by PCR using the following primers: 
SEQ ID NO: 5: 5 ' -GCTGCAGTCTAGAGCTCCATGGCTAGCATGACTGGTG 
GACAGCAAATGGGTGCTAAGCCAGACCGGTCCTCG-3 ' and SEQ ID NO: 6: 

30 5 1 -GAATTCTAGATTCTATTGACTGAAGTAGAGTTC-3 1 . We digested the 
resulting amplified DNA with Xbal and subcloned it into 
an Xbal-cut SV40-based expression vector under the 
control of SRa promoter (pcDLSRa) I Y . Takebe et al.,Jttal*. 
Cftll. Biol. , 8, pp. 466-72 (1988)]. 
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We produced a corresponding inactive mutant 
CMH-1 containing a Cysl86-Ser, a Trp232-+Ala and a 
Arg233-»Glu mutation (C186S/W232A/R233E) by site-directed 
mutagenesis using oligonucleotides SEQ ID NO: 7: 
5 * -CATTCAGGCTAGCCGAGGGAC and SEQ ID NO: 8: 
5 ' -GCTATTACTCAGCTGAGAGCCCAGGA and following well-known 
techniques [T. A. Kunkel, Pror. Nat~ 1 A cari. Sri 82 , 
PP. 488-92 (1985); Y. Gu et al., EMBO J. . (1995), the 
disclosures of which are herein incorporated by 
reference]. These substitutions were at the conserved 
residues for catalysis and PI aspartate binding. 

We transiently transfected COS cells using the 
DEAE-dextran method as described by Y. Gu et al., EMBO 
Hut (1995). COS cells were grown in 6 well culture 
plates and transfected with 3pg of plasmid DNA containing 
wither the active or inactive CMH-1 coding sequences. We 
harvested the cells 24 hours after transf ection . 

Transfection of the expression plasmid in COS 
cells results in high expression and processing of the 
truncated CMH-1 protein. Immunoblotting of the COS cell 
lysates with an anti-T7 antibody detected the unprocessed 
precursor with an apparent molecular weight of 34 kDa and 
a processed product with an apparent molecular weight of 
22 kDa (Fig. 2, panel A) . ~ - 

When we co-expressed T7-CMH-1 with a 43 kDa 
truncated form of T7-tagged PARP, the PARP was cleaved 
into a T7-tagged 31-kDa fragment, similar to the cleavage 
of PARP by CPP32, ICE, Tx, and Nedd-2 [Y. Gu et al., ^ 
BAOl, Chem ., 270, pp. 18715-18 (1995)]. The inactive 
mutant (C186S/W232A/R233E) was defective in both 
autoprocessing and PARP cleavage. 

Similar to other ICE-like cysteine proteases, 
we found that overexpression of the CMH-1 protein induces 
cell apoptosis, as demonstrated by the presence of cell 
shrinkage and DNA fragmentation in the transfected cells" 
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(Fig, 2, panel B) . It was apparent that induction of 
apoptosis was dependent on the presence of cysteine 
protease activity because expression of the active site 
mutant did not result in apoptosis of transfected COS 
5 cells (Fig. 2, panel B) . 

EXAMPLE 3 

Expression and Purificat ion of 
Recombinant CMH-1 In E. coli 

To further characterize the activity of CMH-1 

10 protein, we expressed an N-terminal (His ) 6 -tagged 

CMH-1 (Ala24-Gln303) in E. coli. We constructed an 

expression plasmid for the N-terminal (His) ^-tagged 

CMH-1 (Ala24-Gln303) by introducing in-frame Xhol sites to 

5' and 3' ends of Cmh-1 cDNA using PCR with the following 

15 primers: 

SEQ ID NO: 9: 5' CGGCTCGAGGCTAAGCCAGACCGGTCCTCG and 
SEQ ID NO: 10: 5' GGGCTCGAGCTATTGACTGAAGTAGAGTTC • The 
resulting Xhol fragment was ligated into an Xhol-cut 
pET-15b inducible E. coli expression vector (Novagen, 

20 Madison, WI) . The resulting plasmid encodes a 

polypeptide of 303 amino acids with the 23-residue 
peptide SEQ ID NO: 11: (MGS S HHJJHHHS S GLVPR£SHMLE , where 
LVPRGS (amino acids 14-19 of SEQ ID NO: 11) ^represents a 
thrombin cleavage site fused in frame to the N-terminus 

25 of CMH-1 starting at Ala24, as confirmed by DNA 

sequencing as well as by N-terminal sequencing of the 
expressed proteins. 

E. coli strain BL21(DE3) carrying this plasmid 
was induced with 0.8 mM IPTG, harvested and lysed in a 

30 microf luidizor in a buffer containing 20 mM sodium 

phosphate (pH 6.8), 300 mM NaCl, 2 mM DTT, 10% glycerol, 
0.4 mM PMSF, 2.5 ixq/ml leupeptin. We cleared the lysates 
by centrifugation at 100,000 x g for 30 minutes. 

The supernatant containing soluble CMH-1 were 
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loaded onto a 0.5-ml nickel-NTA column (Qiagen) and 
washed with the same buffer- The column was then eluted 
with increasing concentrations of imidazole in the same 
buffer, and CMH-1 protein was eluted at 100-200 mM to 
give a yield of approximately 5 mg of protein per liter 
of culture. Imidazole was then removed by dialysis. 

The purified protein fraction contained, three major 
polypeptides of approximately 34 kDa, 22 kDa, and 12 kDa 
(Figure 3) . An 11 kDa polypeptide was also present in 
some preparations. 

Immunoblotting of the £. coJi expressed 
(His) 6 -tagged CMH-1 protein with an anti- (His) 6 -tag 
antibody were carried out as described by Gu et al., J. 
94-01 1 Cfrfim. (1995). The N-terminus of each of these 
peptides was analyzed using the ABI 477A Protein 
Sequencer (Applied Biosystems, Inc., Foster City, CA) . 

These studies indicated that the 34 kDa and 22 
kDa polypeptides contain the (His) 6 -tag and the 
predicated N-terminal sequence of the truncated CMH-1 
starting at Ala24. The N-terminus of 12 kDa and 11 kDa 
polypeptides start at Serl99 and Ala207 / respectively, 
both of which are preceded by an Asp residue, suggesting 
that autoprocessing of the CMH-1 (Ala24-Gln303) protein 
occurs predominantly at the Aspl98-Serl99 site and to a 
lesser extent at the Asp206-Ala207 site- The 22 kDa and 
12 kDa proteins are the autoprocessing products of the 
full length 34 kDa protein. They represent the two 
subunits of the CMH-1 protease. 

EXAMPLE 4 
In vitro cleavaaP of PARP bv CMH-1 
In order to determine the activity of purified 
CMH-1, we determined whether the protein could cleave 
2b [S) -labeled pro-IL-lb and PARP and compared its cleavage 
activity to purified ICE. We prepared 35 [ S ] -labeled PARP 
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10 



and IL-lb precursor (pro-IL-lb) by in vitro transcription 
and translation. Cleavage experiments were carried out 
as described previously [Y. Gu et al., ,7 , Rio), Chem. 
(1995)] . Cleavage products were analyzed by SDS-PAGE and 
fluorography as described in that reference. 

The results demonstrated that CMH-1 cleaved 
PARP but not pro-IL-lb. Less than 3 nM CMH-1 was 
required to completely process PARP within 30 minutes 
(Fig. 4, panel A). This efficiency was significantly 
higher than that of ICE in cleaving the same substrate 
under identical conditions, and was similar to the 
activity of CPP32 against PARP (results not shown) or ICE 
against pro-IL-lb IY. Gu et al . , ,T Biol, Chem. , (1995)). 

The cleavage activity of CMH-1 was inhibited by 
15 the tetrapeptide inhibitor DEVD-CHO [D. W. Nicholson et 
al.. Nature , (1995); X. Wang et al., ,T , Plol. Chgm. 
(1995)), as well as by the cysteine-alkylating reagents 
N-ethylmaleimide and iodoacetamide, but not by the ICE 
inhibitors YVAD-CHO [N. A. Thornberry et al., Nature , 

. i , n ;„i rhpm (1995)1 or crmA (a 
20 (1993); Y. Gu et al . , J, PiQl. inem, . Ki.z>?~>i i 

viral protein that is a specific ICE inhibitor) or 

inhibitors of serine proteases or metalloproteases (Fig. 

4, panel B) . These results demonstrate that CMH-1 is a 

cysteine protease with properties closer to CPP32 than to 

25 ICE. This was consistent with the relative sequence 

identity between CPP32 and CMH-1. 
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f.Y AMPLE 5 

Northern Hot analysis of CMH-J. mRNA 
We investigated the expression of Cmh-1 mRNA in 
adult HUMAN tissues was investigated by the Northern 
blotting using a Cmh-1 specific cDNA probe. A human 
multiple tissue RNA blot (MTN blot, Clontfech) containing 
2/zg/lane of poly (A)' RNA was hybridized to the 
"[P] -labeled probe of full length Cmh-1 cDNA in 50% 
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formamide, lOx Denhardts, 2% SDS, 5x SSPE and 100 /zg/ml 
sheared and denatured ssDNA at 42°C for 16hrs. The blot 
was washed in 2x SSC, 0.1% SDS at room temperature for 40 
minutes and at o.lx SSC, 0.1% SDS at 42°C for 20 minutes 
and exposed to Kodak XAR-5 film for 18 hours. 

A 2.5 kb mRNA was detected in tissues including 
pancreas, lung, placenta, kidney, muscle and heart 
Notably, the expression of Cmh-1 was low in liver and 
almost undetectable in brain (Figure 5, . This ubiquitous 
expression of Cmh-l in a variety of tissues is similar to 
that of CPP32 (data not shown) . 

While I have hereinbefore presented a number of 
embodiments of this invention, it is apparent that my 
basic construction can be altered to provide other 
embodiments which utilize the methods of this invention 
Therefore, it will be appreciated that the scope of this 
invention is to be defined by the claims appended hereto 
rather than the specific embodiments which have been 
presented hereinbefore by way of example. 
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SEQDENCE LISTING 

(1) GENERAL INFORMATION: 

(i) ^A^NAMEi Vertex Pharmaceuticals Incorporated 

(B) STREET: 130 Waverly Street 

(C) CITY: Cambridge 

(D) STATE: Massachusetts 

(E) COUNTRY: United States of America 
<F) POSTAL CODE (ZIP) : 02139-4242 

(G) TELEPHONE : 617-577-6000 

(H) TELEFAX: 617-577-6680 

(ii) TITLE OF INVENTION : DNA ENCODING CMH-1 - A CYSTEINE PROTEASE 
INVOLVED IN APOPTOSIS 

(iii) NUMBER OF SEQUENCES: 11 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/007,211 

(B) FILING DATE : 03-NOV-1995 

(vi) PRIOR APPLICATION DATA: rrt/An - 

(A) APPLICATION NUMBER: US 60/007,251 

(B) FILING DATE : 06-NOV-1995 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/558,733 

(B) FILING DATE : 16-NOV-1995 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE : 

(A) NAME /KEY : CDS 



$ SmSic^»> "-"on- "cysteine prote.se' 
/product= "CMH-1" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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CTAATGCGGA GAGAGACTGT GCCAGTCCCA GCCGCCCTAC CGCCGTGGGA ACG ATG 



Met 
1 



56 



10 15 

25 30 

£ 2 E E £ £ £ £ E 5 £ K 2; s S £ ~ 
£ £ 2! £ 2; £ £ £ ?yr a £ £ £ £ E £ - 

55 60 65 

£ E £ £ £ £ £ £ £ £ f" r T «* *» « »« 

7o ys Phe L y s Val 

75 80 

£ £ £ £ £ £ £ £ S £ £ £ £-£ S £ 

5 90 95 

He H£ pJ° ^ A AGC CTG GGT TTT GAC GTG ATT GTC TAT AAT GAC 

Lys Cys Phe Arg Ser Leu Gly P he Asp Val He Val Tyr £n 2p 

105 11Q 

Ser lyl £a L^s Met rf ? T CTG ° TT *** ™ GCT TCT G AA GAG 
ill V 3 LyS Met Leu Leu Lys Ala Ser Glu Glu 

120 125 

§ £ £ £ E E £ E S £ £ £ £ £ £ % - 

135 "0 14 | 

E E £ £ £ £ - - S - « « « £ j. . 536 

150 155 160 

£ £ £ £ £ £ £ £ £ £ - £ £ s £ £ - 

£ £ £ E £ £ £ s £ £ £ £ £ £ £ £ « 

185 190 

£ £ £ E £ £ £ £ £ £ £ £ £ E - « - 



344 



392 



440 



200 205 

CGA TAC AAG ATC CCA GTG GAA GCT GAC TTC CTC TTC GCC TAT TCP arr 
Arg Tyr Lys He Pro Val Glu Ala Asp Phe Leu "e £a Tyl Ser T*hr 

215 220 225 

vll Pro* I M l hC l CG TGG AGG AGC CCA GGA AGA TCC TGG TTT 

Val Pro Gly Tyr Tyr Ser Trp Arg Ser Pro Gly Arg Gly Ser Trp Phe 



728 



776 
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235 



240 



824 



872 



920 



962 



S SB S S 3S S S = i S SS G G S » g S « 

245 250 

S S 5 2 £ 5 s E s s s s s SS IS 

260 265 

S SS S< 5 5 S S 2 2 2 2 i - 222 

275 280 

290 295 
TAGCCATATC AGGGGTACAT TCTAGCTGAG AAGCAATGGG TCACTCATTA ATGAATCACA 1022 
TTTTTTTATG CTCTTGAAAT ATTCAGAAAT TCTCCAGGAT TTTAATTTCA GGAAAATGTA 1062 
TTGATTCAAC AGGGAAGAAA CTTTCTGGTG CTGTCTTTTG TTCTCTGAAT TTTCAGAGAC 1142 
TTTTTTTATA ATGTTATTCA TTTGGTGACT GTGTAACTTT CTCTTAAGAX. TAATTTTCTC 1202 
TTTGTATGTC TGTIACCITG TTAATAGACT TAATACATGC AACAGAAGTG ACTTCTGGAG 1262 
AAAGCTCATG GCTGTGTCCA CTGCAATTGG TGGTAACAGT GGTAGAGTCA TGTTTGCACT 1322 
TGGCAAAAAG AATCCCAATG TTTGACAAAA CACAGCCAAG GGGATATTTA CTGCTCTTTA US2 
TTGCAGAATG TGGGTATTGA GTGTGATTTG AATGATTTTT CATTGGCTTA GGGCAGATTT 14,2 
TCATGCAAAA GTTCTCATAT GAGTTAGAGG AGAAAAAGCT TAATGATTCT GATATGTATC 1S02 
CATCAGGATC CAGTCTGGAA AACAGAAACC ATTCTAGGTG TTTCAACAGA GGGAGTTTAA 1S62 
TACAGGAAAT TGACTTACAI AGATGATAAA AGAGAAGCCA AACAGCAAGA AGCTGTTACC 1622 
ACACCCAGGG CTATGAGGAT AATGGGAAGA GGTTTGGTTT CCTGTGTCCA GTAGTGGGAT 1682 
CATCCAGAGG AGCTGGAACC ATGGTGGGGG CTGCCTAGTG GGAGTTAGGA CCACCAATGG 1742 
ATTGTGGAAA ATGGAGCCAT GACAAGAACA AAACCACTGA CTGAGATGGA GTGAGCTGAG 1.02 
ACAGATAAGA GAATACCTTG GTCTCACCTA TCCTGCCCTC ACATCTTCCA CCAGCACCTT 1862 

„„, -rrnrcta. CCAAGGACCT TGGAAGAGCA AGGGACAGTG 1922 

ACTGCCCAGG CCTATCTGGA AGCCACCTCA tLhA^ 

arrTCCCCCA TAATGTGAAC ATAAGTAATC 1982 
AGGCAGGAGA AGAACAAGAA ATGGATGIAA GCCTGGCCCA TAAK.1 

ACTAATGCTC AACAATTTAT CCAT.CAArC ATTTATTCAT TGGGTTGTCA GATAGTCTAT 2042 
GTATGTGTAA AACAATCTGT TTTGGCTTTA TGTGCAAAAT GTGTTATAGC TTTAAAATAT 2102 
ATCTGGAACT TTTTAGATTA TTCCAAGCCT TATTTTGAGT AAATATTTGT TACTTTTAGT 2162 
TCTATAAGTG AGGAAGAGTT TATGGCAAAG ATTTTTGGGC ACTTTGTTTT CAAGATGGTG 2222 
TTATCTTTTG AATTCTTGAT AAATGACTGT TTTTTTCTGC CTAATAGTAA CTGGTTAAAA 2282 
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AACAAATGTT CATATTTATT GATTAAAAAT GTGGTTGCTT AAAAAAAAAA AAAAAAAAA 2M 1 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
Met Ma Asp Asp Gin Gly Cys He Glu <a„ Gln Gly Val filu ^ 

Ala Asn Glu Asp Ser Val Asp Ala Lys Pro Asp Arg Ser Ser Phe Val 

25 30 
Pro Ser Leu Phe Ser Lys Lys Lys Lys Asn Val Thr Met Arg Ser He 

40 45 

I*. Thr Thr Ar g Asp Ar, vu Pro Thr Tyr Gin Tyr A,„ Met As„ P he 

" 60 
Glu Lys Leu Gly Lys Cys lie n e He Asn Asn Lys Asn Phe Asp Lys 

Val Thr Gly Met G , y Val Arg Asn Gly Thr Asp Lys Asp Ala Glu Ala 

90 95 

Leu Phe Lys Cys Phe Ar 9 Ser Leu Gly P he Asp vel He V,l Tyr As„ 

105 11Q 

Asp cys ser Cys Ala Lys Met Gin Asp Leu Leu Lys Lys Ala Ser Glu 

1Z0 125 
Glu Asp His Thr Asn Ala Ala Cys Phe Ala Cys lie Leu Leu Ser His 

1Jb 140 
Gly Glu Glu Asn Val n. T yr Gly Lys Asp Gly Val Thr ProVle Lys 

155 160 
Asp Leu Thr Ala His Phe Arg Gly Asp Arg Cys Lys Thr Leu Leu Glu 

170 175 
Lys Pro Lys Leu Phe Phe lie Gin Ala Cys Arg Gly Thr Glu Leu Asp 



155 160 

Cys Lys Thr Leu Leu 
170 175 

Cys Arg Gly Thr Glu 
185 190 

ASP Gly lie Gin Ala Asp Ser Gly Pro lie Asn Asp Thr Asp Ala Asn 

200 205 

Pro Arg Tyr Lys He Pro Val Glu Ala Asp Phe Leu Phe Ala Tyr Ser 

<!15 220 

Thr Val Pro Gly Tyr Tyr Ser Trp Arg Ser Pro Gly Arg Gly Ser Trp 

235 240 
Phe Val Gin Ala Leu Cys Ser He Leu Glu Glu His Gly Lys Asp Leu 



255 
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Glu He «et Gin lie Leu It. ». £1 *s» Asp X, v.l Us X. His 

260 265 
Ph e Glu S« Gin S«r Asp MP Pro His Phe His Glu Lys Lys Gin lie 

275 280 
Pro Cys V,l Val Ser Met «u Thr Lys Glu Leu Tyr Phe Ser Gin 

290 295 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(tU ^LS^ioSf/oe^^ipcr^i^onucleotide primer" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CCATCCTAAT ACGACTCACT ATAGGGC 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid oUgorvuc leotide" 
(A) DESCRIPTION: /desc - PCR prime* 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
GCAAACTCTG TCAATTCACC C 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(A) DESCRIPTION : /desc = "pro „n , . 

' c PCR oligonucleotide primer" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID HQS- 

H^ca* ACTGGTGGJVC MaiTOaJecnBora „ 

GACCGGTCCT CG 

(2) INFORMATION FOR SEQ ID NO: 6: ^ 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii} MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = -vrt Jo • 

/aesc - pcr oligonucleotide primer" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GAATTCTAGA TTCTATTGAC TGAAGTAGAG TTC 
(2) INFORMATION FOR SEQ ID NO: 7; 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 21 base pairs ~ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single • 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
oligonucliit?de" RIPTI ° N: /d6SC = "-it-directed mutagenesis 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID 
CATTCAGGCT AGCCGAGGGA C 
(2) INFORMATION FOR SEQ ID NO: 8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

oligonucleotide" 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GCTATTACTC AGCTGAGAGC CCAGGA 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other n ^ c ^ e i^ C ^ imer olig onucleotide" 
(A) DESCRIPTION : /desc - PCR primer v y 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

30 

CGGCTCGAGG CTAAGCCAGA CCGGTCCTCG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

UU "ST Ssc T R "?ioSf 7d."= C i e - C P CRla».r oligonucleotide" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
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GGGCTCGAGC TATTGACTGA AGTAGAGTTC 
(2) INFORMATION FOR SEQ ID NO:ll: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

jet Gly ser Ser His His His „ is Hi s His Ser Ser Gl y Leu Val Pro 



Arg Gly Ser His Met Leu Glu 
20 
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We claim: 

1 An isolated DNA sequence encoding an 
active, activatable or inactive CMH-1 polypeptide selected 
from: 

a. the DNA sequence set forth in SEQ ID 

NO : 1 ; 

b. DNA sequences which hybridize to SEQ 

ID N0:1; and 

c. DNA sequences which but for the 
degeneracy of the genetic code would hybridize to the DNA 
sequences defined in a. and b. and which sequences code 
for a polypeptide having the same amino acid sequences. 

2. An isolated DNA sequence encoding an active 

CMH-1 polypeptide selected from: 

a. a DNA sequence comprising nucleotides 

123-962 of SEQ ID NO:l; 

b. a DNA sequence comprising nucleotides 

138-962 of SEQ ID N0:1; 

c. DNA sequences which hybridizes to the 

DNA sequences defined in a. or b.; and 

d DNA sequences which but for the 
degeneracy of the genetic code would hybridize to the DNA 
sequences defined in a., b. or c. and which sequences code 
for a polypeptide having the same amino acid sequences. 

3. The DNA sequence according to claim 2, 
selected from DNA sequences which encode amino acids 24- 
303 of SEQ ID NO: 2 and DNA sequences which encode ammo 
acids 29-303 of SEQ ID NO: 2. 
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4. An isolated DNA sequence encoding an 
inactive CMH-1 polypeptide selected from: 

DNA sequences which hybridize to SEQ 



a 

ID N0:1; and 



b. DNA sequences which but for the 
degeneracy of the genetic code would hybridize to the DNA 
sequences defined in a. and which sequences code for a 
Polypeptide having the sa^e amino acid sequences; said DNA 
sequence encdoding a polypeptide with no detectable 
cysteine protease activity. 

5. A recombinant DNA molecule comprising a DNA 
sequence according to claim 1. 

6. The recombinant DNA molecule according to 
claxm 5, wherein said DNA sequence is 'operatively linked 
to an expression control sequence, said DNA sequence 
capable of berng expressed in a host cell transfromed with 
said recombinant DNA molecule. 

7. A prokaryotic or eukaryotic host cell 
transformed with a recombinant DNA molecule according to 
claim 5. y 



8- A prokaryotic or eukaryotic host cell 
transformed with a recombinant DNA molecule according to 
claim 6. 



9. The host cell according to claim 7 or 8 
wherein said host cell i s a mammalian cell. 

10. The host cell according to claim 7 or 8 
wherein said host cell is a E . coli. 
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11. A method of producing a DNA sequence 
encoding an active, activatable or inactive CMH-1 
polypeptide comprising the steps of: 

a. incubating a host cell according to 

claim 7 in a growth medium; 

b. separating said cells from said 

medium; and . 

c. isolating said DNA sequence from said 

cells. 

12. A method for producing an active or 
activatable CMH-1 polypeptide comprising the steps of: 

a. incubating a host cell according to 
claim 8 in a growth medium under conditions which promote 
expression of said-^MHpl, polypeptide; 

b. separating said cells from said 

medium; and 

c. isolating said polypeptide from said 

cells . 

13. An active or activatable CMH-1 polypeptide. 

14. An antibody to a CMH-1 polypeptide selected 
from polyclonal antibodies, monoclonal antibodies and 
antibody fragments capable of binding to a CMH-1 
polypeptide . 
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